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The envelope glycoprotein, termed the spike protein, of severe acute 
respiratory syndrome coronavirus (SARS-CoV) is known to mediate 
viral entry. Similar to other class 1 viral fusion proteins, the heptad 
repeat regions of SARS-CoV spike are thought to undergo conforma- 
tional changes from a prefusion form to a subsequent post-fusion form 
that enables fusion of the viral and host membranes. Recently, the 
structure of a post-fusion form of SARS-CoV spike, which consists of 
isolated domains of heptad repeats 1 and 2 (HR1 and HR2), has been 
determined by x-ray crystallography. To date there is no structural 
information for the prefusion conformations of SARS-CoV HR1 and 
HR2. In this work we present the NMR structure of the HR2 domain 
(residues 1141-1193) from SARS-CoV (termed S2-HR2) in the pres- 
ence of the co-solvent trifluoroethanol. We find that in the absence of 
HR1, S2-HR2 forms a coiled coil symmetric trimer with a complex 
molecular mass of 18 kDa. The S2-HR2 structure, which is the first 
example of the prefusion form of coronavirus envelope, supports the 
current model of viral membrane fusion and gives insight into the 
design of structure-based antagonists of SARS. 


In 2003, an atypical pneumonia, now known as severe acute respira- 
tory syndrome (SARS),* caused a deadly worldwide outbreak originat- 
ing in Guangdong province of China, eventually spreading to ~30 coun- 
tries and infecting ~8000 people with a mortality rate of 10% (1). SARS 
is a respiratory illness caused by the newly discovered human corona- 
virus termed SARS-CoV, bearing low sequence homology to other 
members in the Coronaviridae family and therefore classified as a new 
group, group IV (2-5). 

Coronaviruses belong to a family of enveloped viruses, which use 
extracellular glycoproteins to mediate the fusion of the viral and cellular 
membranes. Other members of this group of viruses include: influenza 
virus hemagglutinin (HA), Ebola virus glycoprotein, human immunod- 
eficiency virus (HIV) gp160, and parainfluenza virus HN and F (6-8). 
Such viral envelope glycoproteins consist of 2 extracellular domains: a 
receptor-binding domain anda domain that mediates fusion of the virus 
and target membranes (9). In SARS-CoV, the receptor-binding domain 
is termed S1, and the fusion domain is termed 82. 

The viral fusion proteins, including the SARS-CoV S82, are believed to 
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share a similar mechanism for viral entry and fusion (reviewed in Refs. 7, 
10, and 11). Although the details of SARS-CoV entry are still unclear, it 
is believed that following binding to the host cell receptor, the angioten- 
sin-converting enzyme 2 (12, 13), S2, the fusion domain, undergoes a 
series of structural changes from a prefusion conformation to a post- 
fusion conformation. During these structural changes a fusion peptide 
within $2 is exposed and inserted into the membrane of the target cell. 
Within $2, there are two heptad repeat domains: HR1 and HR2. The 
post-fusion form of the fusion domain is relatively well characterized by 
structural biology. For example, high resolution structures have been 
determined for HIV gp41 (14-16), simian immunodeficiency virus 
gp41 (17-19), influenza virus HA (20), Ebola virus glycoprotein (21, 22), 
mouse hepatitis virus (23), SARS-CoV (24, 25), and parainfluenza virus 
F (26). Note that all of these structures represent the so called six-helix 
bundle form or fusion core, where the N-terminal helices, HR1, form a 
trimeric coiled coil core surrounded by the C-terminal helices, HR2, 
oriented in an antiparallel fashion. This conformation is thought to 
allow the target membrane to come into close contact with the viral 
membrane, ultimately promoting membrane fusion and viral entry. In 
the prefusion state, the HR2 region is thought to self-associate to form a 
trimeric coiled coil. Evidence for the prefusion state is based on studies 
of influenza HA (27, 28) and more recently parainfluenza F (29). Further 
evidence comes from studies of peptides from HIV gp41 and parainflu- 
enza F corresponding to the HR1 and HR2 regions. These peptides are 
thought to inhibit viral fusion by binding to the prefusion conformation 
and thereby block the formation of the HR1/HR2 six-helix bundle that is 
necessary for membrane fusion (7, 30-33). One of the peptides, corre- 
sponding to the HR2 region of gp41, termed T20 or fuzeon, is currently 
used as a drug against HIV (34, 35). Interestingly, analogous peptides 
from SARS-CoV have also been shown to exhibit antiviral activity, 
although at significantly higher concentrations than T20 (36-39). 
Recent studies have shown that SARS-CoV $2-HR2 in isolation, 
unlike the corresponding domain from HIV gp41, forms a helical struc- 
ture in solution based on circular dichroism studies and, further, that 
the predominant species present is that of a trimer (40). However, a 
detailed three-dimensional structure of the SARS-CoV $2-HR2 has not 
yet been available. In this work, we present the solution structure of the 
timeric SARS-CoV-S2-HR2 in the presence of the co-solvent TFE, 
which was necessary to solubilize the construct. We find that $2-HR2 
forms asymmetric coiled coil trimer in solution. Importantly, this struc- 
ture provides the first structural evidence for the prefusion state of 
SARS-CoV envelope. Moreover, the S2-HR2 structure provides a 
framework for the design of novel peptides that inhibit viral entry. 


EXPERIMENTAL PROCEDURES 


Expression and Purification—The SARS-CoV $2-HR2 domain consist- 
ing of residues 1141-1193 was subcloned into the BamHI/HindIII restric- 
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FIGURE 1. **N-edited HSQC of SARS-CoV S2-HR2. The sample conditions were 1 mm S2-HR2 monomer, 10 mm NaPO,, pH 7.0, 30% TFE-d3 at 25 °C. Horizontal lines connect the side 


chain amide proton pairs of asparagine and glutamine residues. 


tion sites of a modified pQE30 expression vector (Qiagen). The resulting 
construct, termed His-PG-S2-HR2, consists of a N-terminal polyhistidine 
tag followed by protein G (the IgG-binding domain of streptococcus pro- 
tein G) (41), a tobacco etch virus cleavage site (sequence, ENLYFQGS) (42) 
for removal of the expression tag, and $2-HR2. For simplicity, from here on 
S2-HR2 will be numbered 1—55 corresponding to residues 1141-1193 of 
the intact protein (the N-terminal glycine and serine residues are a cloning 
artifact). Protein expression of '°N-labeled and '°C/'°N-labeled S2-HR2 
was achieved by growing Escherichia coli strain SG13009 as previously 
described (43). Briefly, the cells were grown in 4 liters of LB medium at 
37 °C until they reached an Ago, of 0.8. Subsequently, the cells were pel- 
leted, washed one time with M9 minimal medium, resuspended in 1 liter of 
M9 minimal medium supplemented with 1 g/liter of ‘"NH,Cl (Martek 
Biosciences, Columbia, MD) and 4 g/liter of '*C-glucose (Isotec, Miamis- 
burg, OH), set to recover for 1 h at 37 °C, induced with 0.8 mM isopropyl 
B-D-thiogalactopyranoside, and grown for an additional 4—5 h at 37°C. 
The His-PG-S2-HR2 fusion protein was purified from the soluble fraction 
using a Ni?" fast flow Sepharose column (Qiagen). The protein was then 
cleaved using tobacco etch virus protease and run once more over the Ni** 
column to remove His-PG and tobacco etch virus protease, which also 
contains a polyhistidine tag. The flow through fraction containing S2-HR2 
was then dialyzed extensively against 10 mM NaPO,, pH 7.0, and concen- 
trated by ultrafiltration (YM3; Amicon, Billerica, MA). The purity and iden- 
tity of S2-HR2 were confirmed by SDS-PAGE and matrix-assisted laser 
desorption ionization time-of-flight mass spectrometry. 
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FIGURE 2. Secondary structure of SARS-CoV S2-HR2. a, secondary chemical shift of 
S2-HR2 '3C,, (gray) and '3C’ (white) with respect to random coil values. Random coil 
values were taken from Wishart and Case (64). 6b, HNOE of S2-HR2. 
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FIGURE 3. CLEANEX-PM '°N-edited HSQC. The sample conditions were 1 mm of S2-HR2 monomer, 10 mm NaPO,, pH 7.0, 30% TFE-d3 at 25 °C with a mixing time of 100 ms. Note that 
residues Thr? and Ser?, which occur in an unstructured region at the N terminus, exhibit negative contours, presumably because of ROE effects and/or intermolecular NOEs with H,O 
(49). The absence of a correlations for residues 17-47 suggests that they are involved in H-bonds. The absence of correlations for residues 14-16 and 50 may suggest the presence 
of H-bonds. The other missing correlations include those of residues 1-3, which are also missing from the '°N-edited HSQC (residue 6 is a proline). 


NMR Spectroscopy—For the NMR experiments, the experimental 
conditions were 1 mM S2-HR2 in 10 mm NaPO,, pH 7.0, 30% TFE-d3 
(Cambridge Isotope Laboratories, Andover, MA), and 10% 7H,O. NMR 
spectra were recorded at 25°C on a Buker AVANCE 800 MHz spec- 
trometer equipped with a cryogenic triple resonance probe. The spectra 
were processed by NmrPipe and visualized with NmrDraw (44). Back- 
bone assignments were determined using a standard set of three-di- 
mensional heteronuclear NMR experiments including: HNCO, CBCA- 
(CO)NH and CBCANH (45, 46). The side chain resonances were 
assigned by three-dimensional '’N-edited TOCSY-HSQC, three- 
dimensional HCC(CO)NH, and three-dimensional CC(CO)NH exper- 
iments. A three-dimensional '"N-edited NOESY-HSQC (mixing time, 
120 ms) was acquired for structural restraint information as well as 
confirmation of side chain assignments. An additional HNHA experi- 
ment was performed to add *J,;.,,, information (47). The intersubunit 
NOEs were identified by recording a three-dimensional F1-filtered, 
F2-edited 'H-'*°C NOESY-HSQC with a mixing time of 120 ms (48) on 
a 1 mM sample comprised of a mixture of 50% ‘*N/'*C-labeled and 50% 
*°N/'°C-labeled monomers. Exchangeable amide protons were identi- 
fied by the CLEANEX-PM experiment with a mixing time of 100 ms 
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(49). HNOE values were measured by a standard pulse sequence 
(reviewed in Ref. 50). The HNOE values for each residue were calculated 
as the intensity ratio (I/Iy) of the ‘7N-'H correlation peak in the presence 
(J) and absence (/,) of proton saturation during the relaxation delay of 
5 s. Spectral processing and analysis was performed using the program 
NMRPipe (44). 

Structure Determination—NOEs were manually assigned and classi- 
fied as: strong (1.8—2.7 or 1.8-2.9 A for the NOE of NH), medium 
(1.8—3.3 or 1.8—3.5 A for NOE of NH), weak (1.8—5.0 A), or very weak 
(1.8—6.0 A). For distances involving methyl protons, 0.5 A was added to 
the upper limit to account for the higher apparent intensity of methyl 
protons. and ¢ torsion angle restraints were derived from “J,;,;, (51) 
and a data base analysis of chemical shifts (N, Hy, C,, Cg, C’, H,) using 
the program TALOS (52). Minimum error ranges for & and ¢ were set 
to +40 and £20, respectively. H-bonds, which were identified by the 
"SC backbone chemical shifts, short range NOE patterns, and nonex- 
changeable protons, were incorporated for the helical region, residues 
17—48, as two restraints per H-bond where ryyy.o = 1.5-2.8 Aand ry 4 = 
2.4—3.5 A. The structures were calculated by simulated annealing in 
torsion angle space (53) starting from three extended strands, followed 
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TABLE 1 
Structural statistics 


| 


pie 
y L30HB 
L37HB 


 L41HB1 
4 L41HB2 


Shown are the structural statistics for the final 30 simulated annealing structures (<SA>) and the minimized mean (<SA>,). None of the structures exhibit distance 
violations greater than 0.5 A, dihedral violations greater than 10°, or coupling constant violations greater than 2 Hz. The number of restraints/subunit is given in parentheses. 


Root mean square deviation from distance restraints 
Total (344) 

Intraresidue (57) 

Sequential (| i — j| = 1 (153) 

Short range (1 < (|i — j| < 5) (49) 

Intermolecular (85) 

Root mean square deviations from dihedral restraints (deg) (110)* 
Root mean square deviations from /,,,,, coupling (Hz) (19) 
Deviations from Idealized Covalent Geometry 
Bonds (A) 

Angels (deg) 

Improper torsion (deg) 

Measures of Structure Quality’ 

Residues in most favorable region (%) 

Residues in additionally allowed region (%) 

Residues in generously allowed region (%) 

Residues in disallowed region (%) 

Number of bad contacts/100 residues 

Coordinates precision (A)* 

Backbone residues 17—47 

Heavy residues 17-47 


“ Torsion angle restraints consisted of 51 ¢, 51 w, 4 v1, and 4 y2 per subunit. 
Overall quality of the structure as determined by the program PROCHECK (58). 


<SA> <SA>, 
0.014 + 0.0020 0.012 
0.004 + 0.003 0.003 
0.010 + 0.003 0.008 
0.014 + 0.003 0.014 
0.024 + 0.004 0.019 
0.19 + 0.14 0.16 
0.449 + 0.09 0.539 
0.0018 + 0.00016 0.0016 
0.389 + 0.022 0.360 
0.339 + 0.026 0.317 
88.1 + 3.2 89.8 
11.5 +3.5 8.2 
04+ 0.8 2.0 
0.0 + 0.0 0.0 
4.8 + 3.3 6.0 
0.46 + 0.17 
1.04 + 0.15 


“ Defined as average root mean square difference between the ensemble structures and the mean structure. 


by conventional simulated annealing (54), using the program CNS 
(55). The program was adapted to incorporate a conformational data 
base (56) and a pseudopotential term for noncrystallographic sym- 
metry (17, 57). A family of the 30 lowest energy structures was cho- 
sen, and a minimized mean structure was calculated. The overall 
quality of the final structures was assessed using the program PRO- 
CHECK (58). The figures were generated using the program MOL- 
MOL (59). 


RESULTS AND DISCUSSION 


Optimization of Experimental Conditions—Previous analytical ultra- 
centrifugation studies have demonstrated that SARS CoV $2-HR2 is a 
trimer (40). Accordingly, our initial NMR experiments on $2-HR2 were 
performed under varying conditions of pH and salt to optimize the 
solubility of the construct. However, at protein concentrations greater 
than 400 uM, which are typically necessary for NMR structural studies, 
S2-HR2 tends to form higher molecular mass species (larger than that of 
a trimer, 18 kDa) as evidenced by the presence of additional broad peaks 
in the '°N-edited HSQC (data not shown). The additional broad peaks 
of the higher molecular mass species are only observed at relatively high 
concentrations and therefore believed to be nonphysiologically rele- 
vant. The co-solvent TFE has previously been used to break up aggre- 
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gates formed between coiled coil complexes without breaking up the 
true coiled coil quaternary structure (60). By the addition of 30% TFE, 
we were able to obtain NMR spectra with one set of resonance peaks at 
a concentration of 1 mM, which was suitable for high resolution struc- 
tural studies. 

Resonance Assignment and Secondary Structure—A two-dimen- 
sional '°N-edited HSQC spectrum of $2-HR2 (buffer conditions 10 mm 
NaPO,, pH 7.0, 30% TFE-d3, 10% *H,O) is shown in Fig. 1. The back- 
bone assignments were obtained using the standard set of three-dimen- 
sional triple resonance experiments as described under “Experimental 
Procedures” (the resonance assignments have been deposited in the 
Biological Magnetic Resonance Databank under entry 6969). The 
*°N-'H assignments are shown in Fig. 1. The °C’ and '°C,, chemical 
shifts with respect to random coil values are shown in Fig. 2a. The 
secondary chemical shift suggests the presence of helix from residues 17 
to 51. As shown by Fig. 2b, the HNOE profile further supports the 
presence of the helical region encompassing this region. For example, 
residues 3—16 and 52—55 clearly exhibit higher mobility with respect to 
the central region. The presence of exchangeable amide protons was 
next probed by CLEANEX-PM experiments (49). As shown in Fig. 3, the 
amides of residues 7—13, 49, and 51-55 are in relatively fast exchange 
with solvent. Based on the '*C chemical shifts, HNOE, and amide 


VOLUME 281*NUMBER 17*APRIL 28, 2006 


SIOZ ‘L Wore, UO TQDO-OleWO UsIE}S9 AA ATU) 18 /S10°0qr MMM//:duny Wor, paprojuUMoqg 


Solution Structure of the SARS-Coronavirus HR2 Domain 


FIGURE 5. Solution structure of SARS-CoV S2- 
HR2. a, ensemble of 30 low energy structures of 
showing the superimposition of the backbone 
atoms. b, ribbon representation of the minimized 
mean structure. c, electrostatic map of the mini- 
mized mean structure. In a and 6, subunits A, B, and 
C are colored red, green, and blue, respectively. The 
direction of the viral membrane is shown by an 
arrow. 


exchange experiment, as well as the short range NOE patterns, S2-HR2 
consists of a continuous helix from residues Asn*’ to Leu*”’. 

Structure Determination—The calculation of a symmetric trimer 
using NMR is a challenging task and requires the use of isotopic-filtered 
NOE experiments (reviewed in Ref. 61). In the present work, intermo- 
lecular NOEs were identified by a three-dimensional F1-filtered F2-ed- 
ited ‘H-'°C NOESY-HSQC (48) experiment recorded on a 1:1 mixture 
of °C/'°N:'7C/"*N-labeled S2-HR2. Based on previous analytical ultra- 
centrifugation experiments, S2-HR2 is a trimer; hence one is not able to 
distinguish between intermolecular NOEs for example between subunit 
A and B or subunit A and C. We therefore assigned the NOEs in sucha 
way that a proton on one subunit is allowed to interact with protons of 
the other subunits as ({r °)~'’° sums (17). Examples of the quality of 
the filtered NOE spectrum are shown for selected residues in Fig. 4. In 
total, all intermolecular NOEs were found between residues in the a, d, 
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FIGURE 6. Structural features of SARS-CoV S2- 
HR2. a, intermolecular contacts between the S2-HR 
helices. b, helical wheel representation of S2-HR2 
residues Asn'’—-Leu*’” looking down the helical axis, 
starting at the N-terminal end. There are seven res- 
idues/heptad where the individual positions of 
the seven residues are denoted by the /etters a-f. 
The residues in the a and d positions make up the 
hydrophobic interface of the trimeric coiled coil of 
the S2-HR2 structure. The a positions are high- 
lighted purple, and the d positions are highlighted 
green. 


e, and g positions of the helical wheel, consistent with the interface of a 
trimeric coiled coil structure, to be discussed below. The solution struc- 
ture of S2-HR2 was determined using information from a total of 344 
NOEs (including 85 intermolecular NOEs), 30 H-bonds, 19 J-coupling 
constants, and 110 dihedral restraints/subunit (Table 1). The structure 
statistics of the family of the 30 lowest energy structures and the final 
minimized mean structure are summarized in Table 1. The superposi- 
tion of the backbone of the final 30 lowest energy simulated annealing 
structures is shown in Fig. 5a. The final ensemble shows no NOE viola- 
tions over 0.5 A and no dihedral angles in the disallowed regions of the 
Ramachandran plot (Table 1). The root mean square deviation of resi- 
dues Asn*’—Leu*’, which comprise the helical regions, relative to the 
mean is 0.46 A for the backbone atoms and 1.04 A for the heavy atoms. 

S2-HR2 Solution Structure—A ribbon representation of the mini- 
mized mean structure of 52-HR2 is shown in Fig. 5d. In this represen- 
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tation, subunits A, B, and C are colored red, green, and blue, respectively. 
S2-HR2 forms a symmetric trimeric coiled coil with an overall length of 
~96 A and width of ~13 A. There are extensive contacts between the 
three subunits with ~6150 A” buried upon complex formation. Inter- 
estingly, the side chains of the two conserved glycosylation sites of HR2, 
Asn’ and Asn’*, are exposed to the surface as expected. A surface 
electrostatic representation of $2-HR2 shown in Fig. 5c reveals that the 
charge is evenly distributed along the entire helix. The main interface of 
the S2-HR2 coiled coil trimer was determined to include residues 
Asn’*’—Leu””’. 

A coiled coil structure is a bundle of a-helices that interact with each 
other in such a way that they will form a left-handed superhelix. Resi- 
dues in the a and d positions are often hydrophobic residues that will 
interact with each other to form the hydrophobic core. This packing is 
often referred to as knobs-into-holes and is the hallmark for a true coiled 
coil structure (62, 63). The packing of the amino acid side chains within 
S2-HR2 coiled coil is shown in Fig. 6. The hydrophobic residues in the a 
position (Val, Ile”’, Ala**, and Leu*’) are colored purple, and those in 
the d position (Ile*’, Leu*®, Leu®’, and Leu®*) are colored green to show 
the relative orientation and positioning. The stacking of these residues is 
arranged in a way that creates a tightly packed core within the S2-HR2 
trimer with the hydrophobic interface buried. Positions e and g within a 
heptad repeat usually contain charged amino acids placed in such a way 
that the charges are complementary to each other to allow electrostatic 
interaction. In the case of S2-HR2, there is only one charged amino acid, 
in position g (Glu*°), and therefore it is not likely that the e and g 
positions play important roles in the stability of the $2-HR2 trimer. On 
the other hand, positions b, c, and f are surface exposed in the coiled coil 
structure and generally encompass polar amino acids that contribute to 
the solubility of the protein. In the coiled coil region of S2-HR2, we note 
that there are two surface exposed hydrophobic residues, Val?" and 
Ile**, that are conserved among coronaviruses. Interestingly, in the post- 
fusion structure (25), Val** and Ile*” form contacts with the HR1 that 
presumably stabilize the six-helix bundle conformation. 

Conclusions—In conclusion SARS-CoV S2-HR2 forms a coiled coil 
structure in solution consisting of three helices coming together to form 
a parallel trimer. As mentioned in the introduction, SARS-CoV S82 is 
thought to exist in two conformations: a prefusion conformation in 
which HR1 and HR2 do not interact with each other and a fusion con- 
formation in which HR2 interacts with the trimeric coiled coil of HR1 in 
an antiparallel fashion to form the six-helix bundle, which was also 
observed for HIV and simian immunodeficiency virus (SIV) gp41 and 
Ebola glycoprotein. In light of our results, it is enticing to speculate that 
the present structure represents a prefusion intermediate of the S2-HR2 
region, which has never before been observed for coroniviridae. Impor- 
tantly, the information provided in this study may be exploited for the 
design of novel antivirals. For example, the relatively poor antiviral 
activity of SARS-CoV S2-HR2 may be a consequence of its tendency 
to form the trimeric coiled coil, and thus one potential strategy 
would be to mutate residues that are important to the formation of a 
HR2 trimer while at the same time maintaining the interactions 
necessary for the HR2-HR1 interaction. 
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